What is claimed is: 
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1 . A computerized method of automatically generating from a first speech 
recognizer a second speech recognizer, said first speech recognizer comprising a first 
acoustic model with a first decision networl< and corresponding first phonetic contexts, 
and said second speech recognizer being adapted to a specific domain, said method 
comprising: 

based on said first acoustic model, generating a second acoustic model with a 
second decision network and corresponding second phonetic contexts for said second 
speech recognizer by re-estimating said first decision network and said corresponding 
first phonetic contexts based on domain-specific training data. 

2. The method of claim 1 , wherein said domain-specific training data is of a limited 
amount only. 

3. The method of claim 1 , said re-estimating comprising: 

partitioning said training data using said first decision network of said first speech 
recognizer. 

4. The method of claim 3, said partitioning step comprising: 

passing feature vectors of said training data through said first decision network 
and extracting and classifying phonetic contexts of said training data. 

5. The method of claim 4, said re-estimating further comprising: 

detecting domain-specific phonetic contexts by executing a split-and-merge 
methodology based on said partitioned training data for re-estimating said first decision 
network and said first phonetic contexts. 
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6. The method of claim 5, wherein control parameters of said split-and-merge 
methodology are chosen specific to said domain. 

7. The method of claim 5, wherein for Hidden-Markov-Models (HMMs) associated 
with leaf nodes of said second decision network, said re-estimatIng comprises re- 
adjusting HMM parameters corresponding to said HMMs. 

8. The method of claim 7, wherein said HMMs comprise a set of states s,- and a set 

of probability-density-functions (PDFS) assembling output probabilities for an 
observation of a speech frame in said states s,., and wherein said re-adjusting step is 

preceded by: 

selecting from said states s,. a subset of states being distinctive of said domain; 

and 

selecting from said set of PDFS a subset of PDFS being distinctive of said 
domain. 

9. The method of claim 7, wherein said method is executed iteratively for additional 
training data. 

1 0. The method of claim 8, wherein said method is executed iteratively for additional 
training data. 

1 1 . The method of claim 7, wherein said first and said second speech recognizer are 
general purpose speech recognizers. 



12. The method of claim 7, wherein said first and said second speech recognizers 
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are speaker-dependent speech recognizers and said training data is additional speaker- 
dependent training data. 

13. The method of claim 7, wherein said first speech recognizer is a speech 
recognizer of at least a first language and said domain specific training data relates to a 
second language and said second speech recognizer is a multi-lingual speech 
recognizer of said second language and said at least first language. 

14. The method of claim 1 , wherein said domain is selected from the group 
consisting of a language, a set of languages, a dialect, a task area, and a set of task 
areas. 

15. A machine-readable storage, having stored thereon a computer program having 
a plurality of code sections executable by a machine for causing the machine to 
automatically generate from a first speech recognizer a second speech recognizer, said 
first speech recognizer comprising a first acoustic model with a first decision network 
and corresponding first phonetic contexts, and said second speech recognizer being 
adapted to a specific domain, said machine-readable storage causing the machine to 
perform the steps of: 

based on said first acoustic model, generating a second acoustic model with a 
second decision network and corresponding second phonetic contexts for said second 
speech recognizer by re-estimating said first decision network and said corresponding 
first phonetic contexts based on domain-specific training data. 

16. The machine-readable storage of claim 15, wherein said domain-specific training 
data is of a limited amount only. 

17. The machine-readable storage of claim 15, said re-estimating comprising: 
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partitioning said training data using said first decision network of said first speech 
recognizer. 

18. The machine-readable storage of claim 17, said partitioning step comprising: 
passing feature vectors of said training data through said first decision network 

and extracting and classifying phonetic contexts of said training data. 

19. The machine-readable storage of claim 18, said re-estimating further comprising: 
detecting domain-specific phonetic contexts by executing a split-and-merge 

methodology based on said partitioned training data for re-estimating said first decision 
network and said first phonetic contexts. 

20. The machine-readable storage of claim 19, wherein control parameters of said 
split-and-merge methodology are chosen specific to said domain. 

21 . The machine-readable storage of claim 19, wherein for Hidden-Markov-Models 
(HMMs) associated with leaf nodes of said second decision network, said re-estimating 
comprises re-adjusting HMM parameters corresponding to said HMMs. 

22. The machine-readable storage of claim 21 , wherein said HMMs comprise a set 
of states Sj and a set of probability-density-functions (PDFS) assembling output 

probabilities for an observation of a speech frame in said states s^, and wherein said re- 
adjusting step is preceded by: 

selecting from said states s,- a subset of states being distinctive of said domain; 

and 

selecting from said set of PDFS a subset of PDFS being distinctive of said 
domain. 



P1021048;2 



EXPRESS MAILING LABEL NO. EK972214875US 

22 



Docket No. DE9-2000-0040 (269) 



23. The machine-readable storage of claim 21 , wherein said method is executed 
iteratively for additional training data. 

24. The machine-readable storage of claim 22, wherein said method is executed 
iteratively for additional training data. 

25. The machine-readable storage of claim 21 , wherein said first and said second 
speech recognizer are general purpose speech recognizers. 

26. The machine-readable storage of claim 21 , wherein said first and said second 
speech recognizers are speaker-dependent speech recognizers and said training data 
is additional speal<er-dependent training data. 

27. The machine-readable storage of claim 21 , wherein said first speech recognizer 
is a speech recognizer of at least a first language and said domain specific training data 
relates to a second language and said second speech recognizer is a multi-lingual 
speech recognizer of said second language and said at least first language. 

28. The machine-readable storage of claim 15, wherein said domain is selected from 
the group consisting of a language, a set of languages, a dialect, a task area, and a set 
of task areas. 
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